Rank in Wordlist | Frequency | Word |
---|---|---|
3808 | 40 | 10,000 |
6508 | 24 | 100,000 |
6511 | 24 | 2,000 |
6788 | 23 | 3,000 |
7409 | 21 | 20,000 |
7779 | 20 | 1,000 |
10258 | 15 | 30,000 |
10949 | 14 | 5,000 |
11717 | 13 | 40,000 |
12599 | 12 | 1,500 |
Rank in Wordlist | Frequency | Word |
---|---|---|
26679 | 5 | 7–6(5 |
26760 | 5 | V(D)J |
31910 | 4 | 7–6(6 |
42361 | 3 | ಕಟ್(ಮುಂದಿನ |
52162 | 3 | ಸೆಡಿಮೆಂಟ್(ನೀರಿನಡಿ |
54208 | 2 | 7–6(3 |
54616 | 2 | LASIK(ಲಸಿಕ್ |
54729 | 2 | O(N |
54903 | 2 | U.S.(ಯು.ಎಸ್ |
54933 | 2 | UV(ನೇರಳಾತೀತ |
Rank in Wordlist | Frequency | Word |
---|---|---|
22813 | 7 | ಹೆಸರು)ಯು |
26760 | 5 | V(D)J |
39756 | 3 | 2001)ನಲ್ಲಿ |
53835 | 2 | 1998)ಮತ್ತು |
53872 | 2 | 2002)ನಲ್ಲಿ |
54335 | 2 | C)ನಷ್ಟಿರುತ್ತದೆ |
54336 | 2 | C)ರಷ್ಟು |
54339 | 2 | C210)ಯು |
54831 | 2 | SCI) |
59819 | 2 | ಒಕ್ಕೂಟ)ದ |
Rank in Wordlist | Frequency | Word |
---|---|---|
16150 | 9 | 70%ನಷ್ಟು |
17879 | 8 | 10%ನಷ್ಟು |
17923 | 8 | 40%ರಷ್ಟು |
17926 | 8 | 50%ನಷ್ಟು |
20150 | 7 | 5%ರಷ್ಟು |
22897 | 6 | 10%ರಷ್ಟು |
22958 | 6 | 20%ನಷ್ಟು |
22982 | 6 | 90%ನಷ್ಟು |
26549 | 5 | 1%ನಷ್ಟು |
26569 | 5 | 15%ರಷ್ಟು |
Rank in Wordlist | Frequency | Word |
---|---|---|
282313 | 1 | ೪೦&೫೦ |
Rank in Wordlist | Frequency | Word |
---|---|---|
11733 | 13 | US$ನಷ್ಟು |
32050 | 4 | US$1 |
32051 | 4 | US$10 |
54920 | 2 | US$100 |
54921 | 2 | US$150 |
54922 | 2 | US$2 |
54923 | 2 | US$3 |
54924 | 2 | US$35 |
86790 | 1 | 10,000$ನಷ್ಟು |
89549 | 1 | 25,000$ನಷ್ಟು |
Rank in Wordlist | Frequency | Word |
---|---|---|
40985 | 3 | ಆಟಗಾರ"ರಿಗೆ |
45730 | 3 | ನಗರ"ವೆಂದು |
49571 | 3 | ಯು"ವನ್ನು |
51299 | 3 | ಸಂಘ"ದ |
51955 | 3 | ಸಿದ್ಧಾಂತ"ದ |
55453 | 2 | ಅಚ್ಚರಿಗಳ"ಲ್ಲಿ |
63195 | 2 | ಗಣ"ವೆಂದೂ |
63552 | 2 | ಗುಣ",ಅಂಕಿಸಂಖ್ಯಾಶಾಸ್ತ್ರ |
66235 | 2 | ತಂಡ"ವೆಂದು |
71409 | 2 | ಪ್ರದರ್ಶನ"ವು |
Rank in Wordlist | Frequency | Word |
---|---|---|
92472 | 1 | Ballon d'Or |
92518 | 1 | Bishop's Stortford |
92547 | 1 | Britain's Got Talent |
94601 | 1 | King's Lynn |
95003 | 1 | Ma'adim Vallis |
95138 | 1 | Muséum national d'histoire naturelle |
95403 | 1 | New Year's |
96733 | 1 | The Cat's Meow |
Rank in Wordlist | Frequency | Word |
---|---|---|
55003 | 2 | ZIP+4 |
88880 | 1 | 2+2 |
92575 | 1 | C++/CLI |
92576 | 1 | C++ನ |
96805 | 1 | U+00B7 |
96806 | 1 | U+1F00 |
96807 | 1 | U+1FFF |
96808 | 1 | U+2660 |
96809 | 1 | U+2667 |
97202 | 1 | W+Kಯ |
Rank in Wordlist | Frequency | Word |
---|---|---|
54244 | 2 | A*- |
87455 | 1 | 150*75 |
87953 | 1 | 18*24 |
89511 | 1 | 24*36 |
94223 | 1 | II*-ಎಂದು |
114934 | 1 | ಇ*ಟ್ರೇಡ್ |
228435 | 1 | ಯಹೂದಿ*ವಿರೋಧಿ |
Rank in Wordlist | Frequency | Word |
---|---|---|
3991 | 39 | ಮತ್ತು/ಅಥವಾ |
7923 | 20 | ಡ್ಯುನೆಡಿನ್/ಡ್ಯೂನ್ಡಿನ್ |
9743 | 16 | ಕನ್ಸಾಸ್/ಕಾನ್ಸಾಸ್ |
16298 | 9 | ಆತಿಥ್ಯ/ಅತಿಥಿ |
16329 | 9 | ಇಂಗಾಲೀಯ/ಜೈವಿಕ/ಸಾವಯವ |
16704 | 9 | ಜಠರ/ಜಠರೀಯ |
17964 | 8 | mmol/L |
18687 | 8 | ಡಬಲ್ಸ್/ಜೋಡಿ/ಯುಗಳ |
20183 | 7 | Mbit/s |
20200 | 7 | mg/dL |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots